Simple Features for Chinese Word Sense Disambiguation

نویسندگان

  • Hoa Trang Dang
  • Ching-yi Chia
  • Martha Palmer
  • Fu-Dong Chiou
چکیده

In this paper we report on our experiments on automatic Word Sense Disambiguation using a maximum entropy approach for both English and Chinese verbs. We compare the difficulty of the sensetagging tasks in the two languages and investigate the types of contextual features that are useful for each language. Our experimental results suggest that while richer linguistic features are useful for English WSD, they may not be as beneficial for Chinese.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chinese Word Sense Disambiguation Based on Lexical Semantic Ontology

This paper describes preliminary works of word sense disambiguation on Chinese verbs using the information derived from lexical semantic ontology (LSO). In spite of sophisticated methods, simple algorithm is employed to underline the characters of the features chosen from LSO data. Several groups of tests are designed to find different effects of the features and other aspects. Some promising r...

متن کامل

رفع ابهام معنایی واژگان مبهم فارسی با مدل موضوعی LDA

Word sense disambiguation is the task of identifying the correct sense for the word in a given context among a finite set of possible sense. In this paper a model for farsi word sense disambiguation is presented. The model use two group of features: first, all word and stop words around target word and topic models as second features. We extract topics from a farsi corpus with Latent Dirichlet ...

متن کامل

The Effect of the Number of Features to Supervised Chinese Word Sense Disambiguation

Although feature selection is very important during either supervised or unsupervised word sense disambiguation processing, there is no systematic study on investigating the relationship between the number of features and the performance as we know yet. This paper investigates the effect of the number of features to supervised Chinese word sense disambiguation through thousands of experiments o...

متن کامل

SRCB-WSD: Supervised Chinese Word Sense Disambiguation with Key Features

This article describes the implementation of Word Sense Disambiguation system that participated in the SemEval-2007 multilingual Chinese-English lexical sample task. We adopted a supervised learning approach with Maximum Entropy classifier. The features used were neighboring words and their part-of-speech, as well as single words in the context, and other syntactic features based on shallow par...

متن کامل

Tor, TorMd: Distributional Profiles of Concepts for Unsupervised Word Sense Disambiguation

Words in the context of a target word have long been used as features by supervised word-sense classifiers. Mohammad and Hirst (2006a) proposed a way to determine the strength of association between a sense or concept and co-occurring words—the distributional profile of a concept (DPC)—without the use of manually annotated data. We implemented an unsupervised naı̈ve Bayes word sense classifier u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002